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Abstract 

We reformulated the string formalism given by Aoyama, using an adjacent matrix of a 
network and introduced a series of generalized clustering coefficients based on it. Furthermore 
we numerically evaluated Milgram condition proposed by their article in order to explore q-th 
degrees of separation in scale free networks. In this article, we apply the reformulation to 
small world networks and numerically evaluate Milgram condition, especially the separation 
number of small world networks and its relation to cycle structures are discussed. Considering 
the number of non-zero elements of an adjacent matrix, the average path length and Milgram 
condition, we show that the formalism proposed by us is effective to analyze the six degrees 
of separation, especially effective for analyzing the relation between the separation number 
and cycle structures in a network. By this analysis of small world networks, it proves that a 
sort of power low holds between M n , which is a key quantity in Milgram condition, and the 
generalized clustering coefficients. This property in small world networks stands in contrast 
to that of scale free networks. 

keywords: small-world networks, scale free networks, generalized clustering coefficients, six 
degrees of separation 

1 Introduction 

Half a century ago, Milgram has found the phenomenon called "Six degrees of separation" [T] by 
making a social experiment. This experiment inspired studies of many researchers after that [2], [3], 
A series of examinations by him and his coworkers [3], [5] made a suggestion, which all people in USA 
are connected through about 6 intermediate acquaintances, more certain. Though there are some 
criticisms in their results, experiments to corroborate their experiments were attempted in various 
groups and theoretical discussions on them also were made [6], [7], [8]. At the end of the twenty 
century, some breakthroughs have come in the research of network theory such as a discovery of 
small world networks [9] .[10] and scale free networks pTj and so on [13], [14]. The understanding of 
the six degrees of separation deepened through such breakthroughs. The understanding of the 
phenomenon, however, is insufficient, especially how cycle structures in a network affect the six 
degrees of separation, more generally the separation number are still obscure. 

The first important study of the effect of cycle structures on the separation number in a network 
has been made by Newman [TS]. But he considered only the effects of the cycle structures with 
3 and 4 nodes, which are triangular and quadrilateral structures. We think that it is difficult 
to pursue the investigation further according to his consideration. Recently Aoyama et al.|16 
developed a method, "string formalism", that could generally analyze the subject. They proposed 
"Milgram condition" to analyze the the separation number. Their evaluation of the separation 
number, however, was made with a tree approximation in scale free networks throughout. Thus 
the effect of cycle structures on the separation number are not explored yet. 

We attacked this subject based on the string formalism by fusing it into adjacent matrix de- 
scription [17] , [18] , [19] , especially how cycle structures in networks affect the separation number q. 
By this, it became really possible to discuss up to just the six degrees of separation, while possible 
up to any degrees of separation in principle. This reformation easily make us extend the usual 
clustering coefficient [5] , which is an index of the number of triangles in a network, to a series of 
generalized clustering coefficients that give indices of the number of cycles with any nodes. We 
showed that six degrees of separation has a close relationship with the scale free network with the 



exponent 3 by applying this formulation to scale free networks 20], [21j . This result is attractive 
since most of scale free networks in the real world have about the exponent 3. 

In this article, we apply this formulation to small world networks. The separation number of 
small world networks and its relation to cycle structures are discussed. By comparing the analyses 
in this article with the results obtained in scale free networks, we show that there are crucial 
differences in the relation between the separation number and cycle structures in both networks. 
Through the considerations of the number of non-zero elements of an adjacent matrix, the average 
path length and Milgram condition, we show that the formalism proposed by us is effective to 
analyze the six degrees of separation, especially for analyzing the relation between the separation 
number and cycle structures in a network. This indicates that our formalism gives an appropriate 
methodology in network analyses. 



2 Reformation of String Formalism based on Adjacent Ma- 
trix 

In this section we review the formalism given in [17] , [18] , [19] , where the reformulation of the 
string formalism proposed by Aoyama;16j by the adjacent matrix is given. 

We consider a string-like part of a graph with connected j nodes and call it "j-string" following 
Aoyama. Let Sj be the number of j-string and Sj be the number of non-degenerate j-string on 
graph. The non-degenerate string is defined as the string that does not has any multi-edges and 
closed cycle structures as a subgraph in a string. We, however, consider strings homeomorphic to 
a circle, called closed strings, as the non-degenerate string. So Sj is the total number of the closed 
strings and the open strings that do not have any closed cycles within themselves. It is generally 
so difficult to calculate Sj and Sj , and would be impossible to practically calculate Sj with j > 7 
at present [TrJ] . 

By using the reformulation, we can represent the usual clustering coefficient which essentially 
counts the number of triangles in a network. Although there are some definitions of the clustering 
coefficient [15], [9], we adopt the global clustering coefficient C( 3 ) [H] defined by 

6 x number of triangles 6A3 . _ 

^ number of connected triplets S3 ' 

where A3 is the number of triangles in a network. But we need more indices in order to uncover 
the effect of general polygon structures in a network. From Eq.(l), we can generalize it to p-th 
generalized clustering coefficient C( p ) straightforwardly [17] . [18], [15] : 

q 2p x number of polygons 2pA p 

^ number of connected p-plets S p ' 

where A p is generally the number of polygons with p edges in a network. 

We reformulate the string formalism by utilizing an adjacent matrix A = (dy). By doing it, we 
succeed in this fused formalism to systematically evaluate Sj and so Cu,) ■ Generally the powers, 
A 2 , A 3 , A A ■ ■ ■ oi A give information as to respecting that a node connects other nodes through 
2, 3, 4, • • • intermediation edges, respectively. The matrix elements of A n indicates the multiplicity 
of the connectivity between two nodes, generally. So A n is not suitable for evaluating the number 
of non-degenerate strings Sj. For resolving the degeneracy of the multiplicity, we introduce new 
series of matrices i?"which give information as to the connectivity of two nodes, io and i n , through 
n intermediation edges without multiplicity. We could find that R n with n > 1 is given by the 
following formula [17], [13], [T5]; 



n o-<w 



rnni _ »fc,»j^ fc -tj>l , . 

[-K \ioin ~ / j a ioii a iii2 ' ' ' a i—l,i n fl _ A "\ ' 



Here the product of the Kronecker delta in the numerator plays role of excluding degeneracies 
strings or multiplicities and the Kronecker in the denominator is needed to keep closed strings, 
respectively. Though Eq.(3) makes one count Sj in a unified way, it is unrealistic to directly 
evaluate the elements of R n from its expression, since this contains multi-loop calculations coming 
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from symbol. Even if we expand Eq.(3), we have 2™(™~ 1 '/ 2 terms. This is 32768 for n = 6 that 
is needed to analyze six degrees of separation. Thus we can not evaluate R n within realistic time 
even in a little large networks. Careful expanding of Eq.(3), however, cause drastic cancellation 
among the terms. We could get some compact expressions for R n for n — 2 ~ 6 in the long run. 
We, however, still need to write down R e about a few pages in A4 size. The explicit expressions 
of R 2 ~ R 6 are described in the references [17], [18], [19]. 
By using R n , we obtain the expressions for S p and C( p y, 



5 p =X)(^~ 1 )ii/2. (4) 
TtRp 



C (P) = \ - ' ( 5 ) 



As an example, we obtain the following formula for the usual clustering coefficient; 



[R% = [A% f - [A 2 ] u S if , 11^1=2^, (6) 



TrR 3 _ TiA 3 



C (3) ~ x - .o, ; .o, : - TTTH mTT? ■ ( 7 ) 



3 Application to q-th degrees of separation 

3.1 Milgram Condition 

We analyze general q-th dedrees of separation, according to our formalism. For q-th degrees 
of separation, Aoyama proposed a condition, so-called Milgram condition at network size N [16 ; 

M q = ^~0{N), (8) 

where it is considered that the contribution of strings homeomorphic to a circle can be ignored at 
the limit of N — > oo. This condition means that the number of g-strings per node is nearly equal 
to the size of the considering network and gives a boundary whether q-th degrees of separation 
is fulfilled or not. This is a natural condition which indicates that a whole network are basically 
connected each other by q steps. As S q grows larger, more exactly M q grows larger under fixed 
N, q-th degrees of separation is easier to be fulfilled. S q in Eq.(8) can be calculated from Eq.(4) 
by using R n . We here place the focus on small- world networks. 



3.2 Application to Small World Networks 

Milgram condition has been applied to scale free networks in [2D], [21] already. In this article, 
we apply Milgram condition to small world networks and clarify the difference between the two 
types of networks. It is showm that the string formalism described by this article is available to 
analyze the relations between the separate number and cycle structures through the results. 

We construct small world networks with any rewiring ratio according to Newman- Watts model 
[2"2] . For it, we choose a regular lattice homeomorphic to one dimension circle S 1 with the degree 
4 as a basic network for constructing the networks. Let a be a rewiring parameter and network 
size is N = 200. Since the estimation of R n presents great computational complexity, it is difficult 
to evaluate R n for large n within realistic computational time. Even in this size, we find that CV3) 
and the average path length show characteristic behaviors of small world networks. (Fig. 1 and 
Fig. 4 shown later indicate these facts. ) 

We need to estimate CV3) ~ Ct$\ as indecis that reflect cycle structures in a network when we 
consider six degrees of separation. So we defined the following quantity; 

X P = EC( 9 ), (9) 

9=3 
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Figure 1: Normalized Xp. 




Figure 2: Milgram condition with q = 3 ~ 6. 



where q is taken from 3, since Cm and C12) do not participate in any cycle structures. X p take on 
the responsibility of information of cycle structures formed by nodes having the number of edges 
within p. 

Fig.l shows X p -log 10 a plot for p — 3 ~ 6. The vertical values are normalized by the maximal 
values of X p every p. A series of data in the top shown by the diamond mark are ones at p = 3 
and date plots in the lower parts represent ones at p = 4, 5, 6, respectively. It may be laid down as 
a general rule that X p behaves as the usual clustering coefficient Cm . The networks can be called 
"generalized small world networks" in that meaning. X$fi increase in some degree at a = where 
the network is a random network. This is because of the finite size effect by starting with rather 
small size circle in the network construction. As p grows larger, the variation in X p becomes smaller 
in Fig.l. This is naturally understood by noticing that there is no cycles with larger number than 
3 nodes in the initial regular network. 

To study the relation between X p and six degrees of separation, we find the relation between 
a and M p . log M q /N ever a is shown in Fig. 2. Four series of data represent data at q = 3, 4, 5, 6, 
respectively. The data within the red rectangle in Fig. 2 mean they satisfy Milgram condition. 
From this, we find that Milgram condition begin to be satisfied just when X p begin to decrease in 
Fig.l. Thus six degrees of separation is achieved by adding rather a small number of short cuts to 
the initial regular lattice. Though Fig. 2 also shows that while four degrees of separation can be 
achieve, it is difficult that three degrees of separation can be achieved even if a comes close to one. 

The consideration of an adjacent matrix also supports that this property is right. Let be r n the 
ratio of the number of the nonzero elements among all elements in A n . r n give information about 
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Figure 3: Numbers of non-zero elements in adjacency matrices. 
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Figure 4: Average path length in small-world networks with several a 



the ratio of nodes connected each other at n steps. When considering q-th degrees of separation, 
we need information about nodes connected each other within q steps, that is, n < q. We define 
T n as the sum of r n ; 

T q = J2r n . (10) 

Tt = l 

Tg-a plot is described in Fig. 3 where the data shown "sequence n" represent the data of T q+ \. There 
are some date with T q > 1 in Fig. 3, because there are situations which two nodes are connected 
each other at some different steps. When we choose T n = 0.5 as borderline that means two nodes 
are connected each other with the possibility of 50 percent including multi-connection, this line 
corresponds to the center of the region satisfying Milgram condition in Fig. 2 ( log M q /N ~ 1 ). The 
situation that two nodes are connected each other with the probability larger than 0.5 corresponds 
to the critical point of Milgram condition. 

We explore the relation between the average path length L and a. The relation is described with 
numerical data of L in Fig. 4. Fig. 4 shows that L rapidly decreases when a grows a little smaller 
as usual small world networks. Considering L ~ 10 conscious of the separation number almost 
< 10, it is consistent with the outcomes referred in Fig. 2 and Fig. 3. So three considerations given 
from Fig. 2, to Fig. 4 almost lead to similar conclusions on the separation number q = n. X p among 
them, however, plays an important role in the discussion on the relation between the separation 
number and cycle structures in a network. This could give a significant methodology to analyze 
them. 

We observe that it becomes easier to realize six degrees of separation or q becomes smaller, 
as the (generalized) clustering coefficient grows smaller from Fig.l and Fig. 2. We investigate this 
a little more quantitatively. Fig. 5 is the graph that represents the relation between \ogX n and 
log M n /N . Each striated data points corresponds to ones at n = 3 ~ 6. We observe that they are 
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almost arranged in straight lines. This means the relation between X p and M n is given by 

M n ~ B n {X n )-\ (11) 

where B n and a > are some constants that are determined by numerical data. This stands in 
contrast to the results of scale free networks shown in Fig. 6 where the relation between them were 

M n ~ exp(cX„), (12) 

where c > is a constants that are determined by numerical data|20j. |21j.. Comparing the signs of 
the parameters (a and c) appearing in Eq.(ll) and Eq.(12), the response of M n to X n is opposite in 
both the networks. While it becomes difficult for small world networks to satisfy Milgram condition, 
it becomes easy for scale free networks to satisfy the condition, as the generalized clustering 
coefficients becomes large. When there are many cycle structures in small word networks, released 
information from a node tend to turn round and round on same cycles so that the propagation of 
the information are seriously obstructed by cycle structures. Contrary to small world networks, 
cycle structures produce some shortcuts to make the propagation of information smooth in scale 
free networks, which are almost tree structures. Thus the separation number is not related to 
cycle structures in the same rule in all networks. The effect of cycle structures on q-th degrees of 
separation in a network strongly depends on the network topology used basically in constructing 
the network. 

Furthermore the relation between \ogX n and log M n /N lies along a line under fixed n and 
the gradient and the intercept on the y-axis of the line undergo a change with n in small world 
networks as shown in Fig. 5. This means that the way of realization of q-th degrees of separation 
depends on network topologies and is different in every n as cycle structures are changing |21). In 
scale free networks where the degree distribution P(k) is given by P{k) ~ fc~ 7 , however, the 
relation between logX n and log M n /N are universal to n when changing the network topologies 
by changing 7. A common straight line appears in logX n vs. M n /N graph of under diverse n. So 
the way of realization of q-th degrees of separation are common to every n in scale free networks. 

4 SUMMARY 

In this article, we investigate six degrees of separation in small world networks by using refor- 
mulation of the string formalism based on an adjacent matrix. This reformulation makes it possible 
for us to systematically evaluate the generalized clustering coefficient. Moreover we attacked the 
problem of general q-th degrees of separation based on a series of the generalized clustering coeffi- 
cients, especially how cycle structures on networks, whose information is charged with generalized 
clustering coefficients, affect the separation number q. Our previous studies support that this refor- 
mulation reconstructs the already known properties in scale free networks and random networks 23 
with the Poisson distribution in the degree distribution, and the formalism is reliable to analyze 
properties of networks [17] . [18] . The analyses by this formalism also uncovered the relation be- 
tween the exponent 7 and six degrees of separation. Especially, we found an interesting fact that 
the scale free network with 7 = 3.0, which many real-world networks have about this value of 
exponent, is closed to six degrees of separation [19], [20], |21) . 

As the result of this article, the general clustering coefficients behave like the usual clustering 
coefficient in small world networks. By considering Milgram condition for the separation number, 
we find that it rapidly gets easier to satisfy the condition by adding only a few edges to a one 
dimensional regular lattice homeomorphic to a circle. This aspect is similar to one of the average 
path length and is also supported from the perspective of an adjacent matrix. Thus the studies of 
Milgram condition, an average path length and an adjacent matrix give almost same information 
in a result. The string reformulation developed by us, however, can only evoke discussion in 
connection with cycle structures. It is a main assertion that the string formalism based on an 
adjacent matrix carries great significance in this article. 

By this analysis, it also proves that a sort of power low holds between M n and the generalized 
clustering coefficients. This property in small world networks stands in contrast to that of scale free 
networks. In small world networks, cycle structures operates as resistance to the propagation of 
information and the separation number rather decreases when there is not any cycles. This means 
that the effect of cycle structures on the separation number strongly depends on the construction 
method of networks or the basic properties of network. 
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Figure 5: Sum of C( p ) v.s.\ogM q /N in small world networks for every separation number n. 




To give a compact and systematic expression of R n for large n in order to study beyond six 
degrees of separation is a future research topic. It should be studied that the main results given 
by this article are also confirmed for larger size networks. Anyway extensive calculations and so 
highly efficient computer are needed to accomplish them. It is, however, confirmed by us that the 
essential behaviors do not depend on the network size up to a point in scale free networks. This 
fact is a natural consequence from the definition of the concept of " scale free" . 
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